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Echo cancellation techniques have been widely 
used in baseband digital transmission systems to 
provide full duplex data traffic over a two-wire 
network. The implementation of an echo canceler 
using the H/EDSP16/DSP16A Digital Signal 
Processor (DSP) is given in the first section of this 
application note. The DSP16/DSP16A is a general- 
purpose, 16-bit DSP. The echo canceler is based 
on an adaptive transversal filter using the least 
mean square (LMS) algorithm to update the filter 
coefficients. Two examples are given: one employs 
single-precision filter coefficients as well as single- 
precision algorithm calculations; the other uses 
double-precision arithmetic, except that the input 
and output data are represented as single-precision 
words. The trade-offs between the two systems are 
the data rate and echo rejection. The double- 
precision implementation can offer better echo 
rejection while the single-precision design allows a 
system with a higher data rate. 

Complex adaptive equalizers are a widely used 
building block in digital communications systems. 
Their function is to equalize channel distortions in 
order to reduce intersymbol interferences (ISI) at the 
receiver. The second section of this application note 
describes complex adaptive equalizer 
implementations for both linear equalizers (LE) and 
decision feedback equalizers (DFE) using the 
DSP16/DSP16A. The LMS adaptation algorithm is 
employed to update the equalizer coefficients. Each 
equalizer is tested by applying a 16-point quadrature 
amplitude modulated (16-QAM) signal transmitted 
through a long-haul voiceband telephone channel. 
For an 8-tap LE, the DSP16/DSP16A program takes 
a maximum of 239 cycles to process each input data 
symbol. For a DFE consisting of a 3-tap forward 
equalizer and a 5-tap feedback equalizer, the 
program execution takes 265 cycles. Using a 55 ns 
DSP16/DSP16A processor, the throughputs for the 
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linear and decision feedback equalizers are under 
14 lis and 15 (is, respectively. 

Digital Echo Canceler 
Implementation 

A two-wire, full duplex baseband data transmission 
system usually relies on an echo canceler to 
decouple the outgoing transmitted signal and the 
incoming received signal. An example of this 
system is the integrated service digital network's 
(ISDN) U interface which supports 160 Kbit/s (80K 
baud using 2B1Q - two binary to one quaternary - 
line code) data rate on a subscriber loop over 
distances up to 5 Km [1]*. The application of the 
echo canceler in systems of various data rates and 
distance coverages has also been reported [2,3]. In 
most systems, the worst case signal attenuation of 
40 — 50 dB can be expected. Therefore, in order to 
achieve a signal-to-echo ratio of 20 dB as required 
for low bit error rate (BER) transmission, the echo 
canceler has to provide 60—70 dB echo rejection. 

The echo canceler can be implemented as an 
adaptive transversal filter which automatically 
adapts its coefficients to the impulse response of the 
echo path. If the echo path is a linear system and 
the transversal filter contains sufficient taps to cover 
the entire period of the echo's impulse response, the 
echo replica generated by the transversal filter can 
completely cancel out the echo signal from the 
received signal. However, due to the finite precision 
of the DSP and the tap misadjustments inherent to 
the adaptation algorithm, a small residual echo 
always exists. 

The LMS algorithm [4] is the most popular algorithm 
used in adaptive transversal filters. The 
computational requirements (measured by the 
multiplication/addition operation) is 2N+1 operations 
per data sample, where N is the number of taps. 
The implementation of the LMS adaptive filter using 
a fixed-point processor introduces two major 
sources of error: one is from the quantization error in 
the filter coefficients, and the other arises when the 
tap updates become smaller than the least 
significant digit of the processor. This causes the 
adaptation process to stall. 

In this section, a 48-tap echo canceler implemented 
using both single-precision and double-precision 
arithmetic is described. Both implementations are 
tested using a simulated two-wire circuit which 
consists of a 22-gauge 10 Km loop, the termination 
network, and the transmit and receive filters. The 



results are compared with those obtained from using 
floating point arithmetic. 

Digital Eoh© Canceler for Baseband Data 
Transmission 

A two-wire data transmission system with echo 
cancellation is shown in Figure 1 . Let the input 
signal sequence to the echo canceler be a(n), and 
the coefficients of the transversal filter be c(n). Then 
the echo replica at the output of the echo canceler 
is: 

1) S(n) = £ 1 Ci (n)a(n-i). 

i=0 

The input to the receiver is the the far-end signal 
s(n) and the echo signal e(n). At the echo canceler 
output, the received signal is : 

2) r(n) = s(n) + e(n)-£(n). 

The mean square residual echo E((e(n) - £(n)) 2 ) 
can be minimized by reducing the received signal 
E(r 2 (n)); if s(n) and e(n) - 6(n) are uncorrelated. 
Using the LMS algorithm, the coefficient adaptation 
follows: 

3) Ci(n+1) = Cj(n) + |ir(n)a(n). 

In order to achieve a high degree of cancellation 
and due to the far-end signal acting as noise to the 
adaptation process, the convergence factor jll must 
be set to a small value to reduce the coefficient 
misadjustment at steady-state. However, a small 
value of |jl results in slow convergence. Also, as 
mentioned earlier, the echo canceler utilizing the 
adaptive transversal filter technique is only capable 
of canceling the linear echo term. The non-linear 
echo that might be introduced by a buffer amplifier, 
hybrid transformer, analog-to-digital converter, etc. 
will remain uncanceled. 

Because of the timing requirements in the data 
transmission system, e.g., the ISDN, the echo 
canceler of data symbol rate is usually not 
adequate. Since the input rate is equal to symbol 
rate, this implies that the echo canceler should have 
a higher output rate than the input rate. Figure 2 



[] indicates a reference listed at the end of this section, pg. 6. 



Digital Echo CasieeSer and Complex Adaptive Equalizers 




Figure 1. Diagram of Two-Wire Data Transmission System With Echo Canceler (ec) 




ATF = Adaptive Transversal Filter. 

Figure 2. Block Diagram of Echo Canceler With Interpolation Factor R = 4 
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shows the block diagram of an echo canceler with 
an interpolation factor of 4 (R=4). It consists of four 
independent transversal filters, each 12 taps. The 
input to the echo canceler is the transmitted data 
symbol sequence of rate 1/T. The output rate is 4/T, 
which corresponds to the echo replica of four 
samples in one symbol interval. 

DSP16/DSP16A implementation of the 
Echo Canceler 

The implementation of the echo canceler using the 
DSP16/DSP16A processor involves programming 
the three operations described in equations (1) — (3) 
according to the block diagram in Figure 2. To 
perform the convolution of equation (1) using the 
DSP16/DSP16A, all multiplication products must be 
represented exactly using a 32-bit product register 
and rounding should be performed only after they 
are summed (in a 36-bit accumulator), i.e., at the 
filter output.* In this case, the output roundoff noise 
is uniformly distributed between -Q/2 and Q/2 with a 
mean of zero and a variance of Q 2 /12, where Q is 
the quantization step size. The same roundoff noise 
is true for equation (2). In equation (3), if c is 
represented as a 16-bit word, any tap update 
smaller than the value rounded to a 16-bit word will 
be dropped, and the adaptation process cannot go 
any further to reach the minimum mean square 
error. In other words, adaptation stops when 

4) least significant digit of the coefficient > jir(n)a(n). 

Therefore, in a fixed-point implementation of an 
adaptive transversal filter, there exists an optimum jn 
at which the inherent tap misadjustment is equal to 
the least significant digit of the coefficient. 

If the level of cancellation cannot be met by single- 
precision arithmetic, double precision coefficients 
and coefficient update algorithm should be used 
instead. Using double-precision, the adaptation 
stopping criteria, equation (4), can be reduced by a 
factor of 2 16 . However, the roundoff noise from the 
transversal filter should remain unchanged since the 
data in both cases is rounded off to 16-bit words. 
The DSP16/DSP16A is a 16-bit device which 



features a 16-bit data bus, a 16x16 2's complement 
multiplier that generates a full 32-bit product, and 
two 36-bit accumulators. To use double-precision 
(32-bit) coefficients, the 32-bit word must be broken 
into two 1 6-bit words for storage as well as 
arithmetic operations. One way to do this is to 
represent a 32-bit number in accumulator aO as 

5) aO = aO M sw + aO LS w2" 16 

where aO M sw is the 2's complement most significant 
word and aO LS w is the 2's complement least 
significant word of aO defined by the following 
equations: 

6) aO MS w = md(aO); 

rnd() = DSP16/DSP16A rounding instruction 

7) aO LSW = aOI2 16 ; 

aOI=lower half of a 32-bit word 

This method utilizes three DSP16/DSP16A special 
function instructions: rounding, right shift 16 bits, 
and left shift 16 bits. There is no need to perform 
software unsigned multiplication that may be 
required using other approaches. 

The program listings of echo cancelers using both 
single-precision and double-precision arithmetic are 
at the end of this section. 

Simulation Results ©f the Eeh© Caooelers 

The echo canceler programs are tested in the envl 6 
(an integrated DSP16/DSP16A assembler and 
simulator) environment. The input sequence to the 
echo canceler is a 19.2 Kbit/s random data 
sequence using bipolar coding. The echo path 
impulse response is shown in Figure 3, which 
corresponds to a 22-gauge, 10 Km loop with 
termination network and the transmit/receive filters. 
The far-end impulse response is shown in Figure 4. 
The echo canceler used in this case has a total of 
48 taps and an interpolation factor R=4. The 
maximum impulse response duration is 12 input 
symbol intervals. The results of single-precision, 



Rounding instead of truncation should be performed in 
equations (1) — (3) in order to obtain the zero mean statistics; 
also, the bias as a result of numerical truncation in the 
adaptive filter coefficients can seriously degrade filter 
performance. 
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Figure 3. Echo Path Impulse Response 

double-precision, and floating-point* 
implementations of the same echo canceler are 
listed in Table 1 . 

Table 1. Signal to Echo Ratio (dB) vs 
Iterations 



No. of 


Floating- 


Single- 


Double- 


Symbols 


Point 


Precision 


Precision 





M>.016 


MJ.016 


M).016 


-29.06 


-29.06 


-29.06 


2000 


-9.79 


-9.82 


-9.79 


4000 


2.43 


1.90 


2.43 


6000 


4.81 


4.00 


4.81 


8000 


5.40 


4.67 


5.40 


10000 


5.65 


5.09 


5.66 


12000 


5.70 


5.01 


5.71 


14000 


^1=0.00025 


|i=0.01 


u.=0.00025 


11.46 


7.09 


11.40 


16000 


18.69 


7.27 


18.56 


18000 


20.04 


7.21 


20.01 



.008 
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Figure 4. Far-End Impulse Response 

double-precision implementations achieved the 20 
dB signal-to-echo ratio. Notice that the results 
between the floating-point and double-precision 
implementations are very close. The slight 
discrepancy can be attributed to the quantization 
noise after rounding the echo canceler output (r(n)) 
to a 16-bit word. By imposing the criteria described 
by equation (4) on ji, the minimum \i that can be 
used without making the adaptation process stop is 
0.01 in the single precision implementation. The 
final echo level is 7.21 dB below the received far- 
end signal. This is also close to 7.77 dB obtained 
using floating-point arithmetic with the same \i. 

The single-precision implementation of a 48-tap 
echo canceler takes 445 machine cycles while the 
same canceler with double-precision arithmetic 
takes 1065 cycles to process each input data 
symbol and output four samples of far-end signal 
without echo. For a 33 ns version DSP1 6A, the data 
output rate is 3.7 and 8.8 |u,s, respectively. The 
single-precision implementation requires 101 words 
of ROM and 59 words of RAM. On the other hand, 
177 words of ROM and 107 words of RAM are 
needed using double-precision program. 



At the start of adaptation, all coefficients are set to 
zero. The initial \i is chosen to be 0.016 for fast 
convergence, but the steady-state signal-to-echo 
ratio is only 5.7 dB. By using a smaller \i to continue 
the adaptation process, both floating-point and 



The floating-point results are obtained from a C program on a 
SUN 3 computer using 32-bit single-precision floating-point 
arithmetic. 
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Conclusion 

The implementation of two digital echo cancelers for 
baseband data transmission using the 
DSP16/DSP16A DSP is described in this application 
note. The single-precision echo canceler has a 
throughput advantage and occupies less memory 
space than its double-precision counterpart. 
However, the double-precision echo canceler offers 
much greater degree of cancellation and is limited 
only by the 16-bit quantization noise. The speed and 
flexibility of the DSP16/DSP16A device lends itself 
to the implementation of echo cancelers in two-wire 
full duplex baseband data transmission systems. 
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Program Listing 1. Echo Canceler - Single-Precision 



/* */ 

/* Digital Echo Canceler for Two-Wire Data */ 

/* Transmission - Single Precision Coefficients */ 

/* */ 

/* by S. Tu, AT&T Bell Laboratories, Allentown, PA 10/28/88 */ 

/* */ 



. ram 

datl: 

datl2: 

cfl: 

cf2: 

cf3: 

cf 4: 

rx: 

. endram 

start : 



ll*int 
int 
12*int 
12*int 
12*int 
12*int 
int 



auc=0x02 

pt=CFl 

r2=cfl 



do 48 { 



*r2++=x 
} 

rl=datl2 
r3=rx 
rb=datl 
re=datl2 

J — 12 
i=0 
c0 = 



/* allocate 12 ram spaces for 12-tap 

/* delay lines 

/* 12 coef. for sub-canceler 1 

/* 12 coef. for sub-canceler 2 

/* 12 coef. for sub-canceler 3 

/* 12 coef. for sub-canceler 4 



*/. 

*/ 

*/ 

*/ 

*/ 

*/ 



y=a0 



x=*pt++ 



/* set alignment for Q14 format */ 

/* pt - coef. in rom */ 

/* r2 - coef. in ram */ 

/* transfer start-up coef. */ 

/* from rom to ram (y=a0 is a */ 

/* dummy fetch) */ 



/* rb points to beginning, re */ 
/* to end of modulo. */ 



/* initialize counter cO to 



loop: *r3=sdx 



/* read s(n)+e(n). from SIO 
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cnvl : 



if cOlt goto cnvl 

*rl++=pdxO 

c0=-3 

r2=cfl 



aO=p 
aO=aO-p 



do 11 { 



aO=aO+p 
} 



p=x*y 



x=*r2++ 
y=*rl++ 

x=*r2++ 
y=*rl++ 



/* read a (n) from PIO every */ 

/* 4th loop. */ 

/* set cO to -3 */ 

/* start convolution routine */ 

/* load x, y and clear aO . */ 

/* perform convolution to */ 

/* obtain echo estimate. */ 



p=x*y 



lms : 



end: 



CF1: 



aO=aO+p 

y=*r3 
aO=aO-y 
aO=-aO 
aO=rnd (aO) 
sdx=aO 



p=x*y 
aO=p 

aO=rnd(aO) 
y=aO 



do 11 { 



*r2++j 



y=aO 
x=*rl++ 



x=*pt++i 





p=x*y a0=*r2 


aO=a 


0+p x=*rl++ 


aO=rnd(aO) 


*r2++=a0 


> 


p=x*y a0=*r2 


aO=a 


0+p 


aO=rnd (aO) 


*r2++=a0 


goto 


loop 




/* fi. 


int 


0.00043 


int 


-0.00098 


int 


0.00177 


int 


0.00360 


int 


0.00592 


int 


0.00879 


int 


-0.00037 


int 


0.14478 


int 


0.01013 


int 


-0.00085 


int 


-0.00079 


int 


0.0 


int 


0.00055 


int 


-0.00220 


int 


0.00153 


int 


0.00323 


int 


0.00500 


int 


0.00983 


int 


0.02832 


int 


0.30933 


int 


0.01343 


int 


-0.00128 


int 


-0.00110 


int 


0.00043 


int 


0.00055 



/* end of convolution routine 

/* subtract estimated echo 

/* from received signal then 

/* output the result to SIO 

/* after rounding 



*/ 
*/ 
*/ 

*/ 



/* start tap adaptation routine */ 

/* scale aO by the factor |1 */ 

/* then load to y for tap */ 

/* update calculation */ 



/* update filter coefficients 



/* end of adaptation routine 



*/ 



/* filter final coef. settings */ 
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int 


-0.00189 


int 


0.00116 


int 


0.00269 


int 


0.00421 


int 


0.00885 


int 


0.03125 


int 


0.24213 


int 


-0.03455 


int 


-0.00446 


int 


-0.00098 


int 


0.00031 


int 


0.00012 


int 


-0.00049 


int 


0.00055 


int 


0.00256 


int 


0.00378 


int 


0.00757 


int 


0.01325 


int 


0.06293 


int 


-0.02692 


int 


-0.00574 


int 


0.0 


int 


0.00037 


int 


0.01 



Program Listing 2. Echo Canceler - Double Precision 



/* 
/* 
/* 
/* 
/* 
/* 



Digital Echo Canceler for Two-Wire Data 
Transmission - Double Precision Coefficients 

by S. Tu, AT&T Bell Laboratories, Allentown, PA 11/28/88 



. ram 

datl: 

datl2: 

cfl: 

cf2: 

cf3: 

cf 4: 

scl : 
rx: 
. endram 

start : 



ll*int 

int 

24*int 

24*int 

24*int 

24*int 

2*int 

int 



auc=0x02 

pt=CFl 

r2=cfl 



do 9 6 { 



} 



*r2++=x 



/* allocate 12 ram spaces for 12-tap 

/* delay lines 

/* 12 coef. for sub-canceler 1 

/* 12 coef. for sub-canceler 2 

/* 12 coef. for sub-canceler 3 

/* 12 coef. for sub-canceler 4 



*/ 

*/ 
*/ 
*/ 
*/ 
*/ 



*/ 
*/ 
*/ 
*/ 
*/ 
*/ 



y=a0 



x=*pt++ 



/* set alignment for Q14 format */ 

/* pt - rom coef. pointer */ 

/* r2 - ram coef. pointer */ 

/* transfer start-up coef. */ 

/* from rom to ram (y=a0 is a */ 

/* dummy fetch) */ 



r0=scl 

rl=datl2 

r3=rx 

rb=datl 

re=datl2 

j— 24 

i=0 

c0=0 

loop: *r3=sdx 



/* rb points to beginning, re 
/* to end of modulo. 



/* initialize counter cO to 
/* read s (n) +e (n) from SIO 
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if cOlt goto cnvl 

*rl++=pdxO 

c0=-3 

r2=cfl 



cnvl : 



aO=p 

aO=aO-p 

al=aO 



do 11 { 



aO=aO+p p=x*y 

al=al+p 

} 

p=x*y 
aO=aO+p p=x*y 
al=al+p 



aO=aO»16 

y=al 

al=al«16 

yl=al 

aO=aO+y 

aO=aO-y 
aO=-aO 
al=rnd(aO) 
sdx=al 



lms : p=x*y 

aO=p 

*r0++=a01 
aO=rnd(aO) 
*r0 — =a0 
move y=*rO++ 

do 11 { 

p=x*y 
aO=p p=x*y 
a0=a0»16 
aO=aO+p 

al=rnd(aO) 
aO=aO«16 

aO=aO+y 
aO=aO»16 

al=al+y 

aO=rnd(aO) 

y=aO 

al=al+y 

*r2++=al 

} 

p=x*y 
aO=p p=x*y 
aO=aO»16 
aO=aO+p 

al=rnd(aO) 
aO=aO«16 



x=*r2++ 
y=*rl++ 



x=*r2++ 
y=*rl++ 
x=*r2++ 

x=*r2++ 

x=*r2++j 



y=*r3 



y=al 



x=*rl++ 



y=*rO — 
x=*rl++ 



y=*r2++ 



y=*r2 — 

*r2++=a01 

y=*rO++ 

y=*rO — 
y=*r2++ 



/* read a(n) from PIO every 

/* 4th loop iteration. 

/* set cO to -3 

/* start convolution routine 

/* load x, y and clear aO,al 



/* perform double precision 
/* convolution to obtain echo 
/* estimate 



*/ 
*/ 
*/ 

*/ 
*/ 



/* al - tap update (MSW) 
/* aO - tap update (LSW) 

/* update the LSW of the coef . 



/* update the MSW of the coef. 



*/ 
*/ 
*/ 



/* aO- convolution output (LSW) */ 
/* al- convolution output (MSW) */ 
/* end of convolution routine */ 



/* align aO with al */ 

/* transfer alh to yh first, */ 

/* then all to yl . */ 

/* add aO and al to yield echo */ 

/* estimate; fetch s(n)+e(n) */ 

/* subtract estimated echo */ 

/* from received signal, then */ 

/* output result to SIO after */ 

/* rounding */ 

x=*pt++i /* fetch canceler output to y, */ 

/* convergence factor to x */ 

/* start lms adaptation routine */ 

/* multiply x and y to yield 32 */' 

/* bit product; convert p to two */ 

/* 16 bit words and store in */ 

/* ram location pointed to by rO */ 



/* update filter coefficients */ 

/* perform 32 x 16 -> 32 */ 

/* multiplication to yield 32 */ 

/* bit coefficient update */ 



*/ 
*/ 

*/ 
*/ 
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end: 



CF1: 



aO=a 


0+y y=*r2 — 


aO=a 


0»16 


al=a 


1+y *r2++=a01 


aO=rnd(aO) 


y=aO 




al=a 


1+y 


*r2++=al 


goto 


loop 




/* filt 


int 


0.83484 


int 


-0.00024 


int 


-1.17218 


int 


0.00037 


int 


-1.12122 


int 


0.00177 


int 


-0.19501 


int 


0.00336 


int 


0.58185 


int 


0.00610 


int 


-1.71277 


int 


0.00873 


int 


0. 93817 


int 


0.00012 


int 


-0.39166 


int 


0.14514 


int 


-1.84900 


int 


0.01007 


int 


-1.12067 


int 


-0.00043 


int 


-1.42810 


int 


-0.00055 


int 


-1.91608 


int 


-0.00024 


int 


-0.77722 


int 


-0.00055 


int 


-1.00397 


int 


-0.00024 


int 


1.43420 


int 


0.00104 


int 


-1.47412 


int 


0.00256 


int 


1.48413 


int 


0.00464 


int 


-1.12616 


int 


0.00934 


int 


-0.84863 


int 


0.02875 


int 


-1.06116 


int 


0.30914 


int 


1.70483 


int 


0.01263 


int 


0.95929 


int 


-0.00055 


int 


0. 95587 


int 


-0.00092 


int 


1.71930 


int 


-0.00037 


int 


-0.64282 


int 


-0.00049 


int 


0.57837 


int 


-0.00024 


int 


-0.49121 


int 


0.00092 


int 


0.26746 



/* end of adaptation routine 



*/ 



filter final coef . settings */ 
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int 


0.00220 


int 


-0.26996 


int 


0.00421 


int 


0.10040 


int 


0.00818 


int 


-0.78595 


int 


0.03174 


int 


-0.57611 


int 


0.24237 


int 


-0.25562 


int 


-0.03516 


int 


1.80664 


int 


-0.00372 


int 


-1.50757 


int 


-0.00055 


int 


0. 93994 


int 


-0.00031 


int 


-1.84192 


int 


-0.00012 


int 


-0.39398 


int 


0. 


int 


1.40552 


int 


0.00092 


int 


0.29498 


int 


0.00226 


int 


1.03375 


int 


0.00403 


int 


1.17072 


int 


0.00757 


int 


-1.82428 


int 


0.01349 


int 


-0.10437 


int 


0.06342 


int 


-0.08191 


int 


-0.02716 


int 


0.14935 


int 


-0.00543 


int 


1.98663 


int 


0. 


int 


-0.93744 


int 


-0.00018 


int 


0.00025 



Complex Adaptive Equalizers 

Transmitting high bit-rate data successfully through 
a switched telephone network, microwave or 
satellite link, etc., often requires adaptive 
equalization to remove excessive channel 
distortions (i.e., amplitude and phase distortion or 
multipath fading distortion). The adaptive equalizer 
is usually implemented using transversal filters with 
certain algorithms controlling the update of the 
coefficients [1].* The two most popular structures 
used in adaptive equalization are the LE and DFE. 



The LE, shown in Figures 5 and 6, derives its output 
by linearly combining the past received signals 
weighted by the coefficients. The DFE, shown in 
Figures 7 and 8, is comprised of both a forward 
equalizer and a feedback equalizer. The output of 
the DFE is the sum of the two equalizers. The DFE 
is found to be particularly effective when the channel 
is highly distorted. In addition, it offers the 
advantages of being less sensitive to symbol timing 
accuracy and having less noise enhancement 
problems, as compared to linear equalizers. 



[] indicates a reference listed at the end of this section, pg. 
16. 
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In this section, the implementations of an 8-tap 
complex linear equalizer and a DFE of 3 forward 
taps and 5 feedback taps using the DSP16/DSP16A 
processor are presented. Included is a description of 
the system blocks and the algorithm, the 
DSP16/DSP16A programs, and the testing results of 
the equalization of a 16-QAM signal sent through a 
dispersive voiceband telephone channel. 



x2 



COMPLEX 
ADAPTIVE FILTER 



e1 



■^^0— 



hr-9- 



d1 



d2 



e2 



System Description and Algorithm 

Figure 5 shows the block diagram of a complex 
adaptive equalizer. Inputs x1 and x2 correspond to 
the real and imaginary data paths. Outputs y1 and 
y2 of the complex transversal filter (Figure 6) are 
given in the following equations: 

8) y1(n)= Edi(n)x1(n-i)- 2c2i(n)x2(n-i); 

i=0 i=0 

N = number of taps 

9) y2(n)= Ec2i(n)x1(n-i)-f Sdi(n)x2(n-i) 

i=0 i=0 

The well-known LMS algorithm [2] is used to update 
coefficients d and c2 to minimize the mean square 
error, E[ e1 2 + e2 2 ]. With a quadratic error function, 
the minimum mean square error is obtained where 



Figure 5. Complex Linear Equalizer Block 
Diagram 



Cln 



X1 



C2 n 



c2 n 



x2 




Figure 6. Complex Transversal Filter 
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the gradient is zero. The estimated gradient for d 
is e1x1 + e2x2 and for c2 is e2x1 - e1x2. The 
coefficient adaptation follows 

1 0) d j(n+1 )=cl j (n)+ mu[e1 (n)x1 (n-i)+e2(n)x2(n-i)] 

11) C2j(n+1)=c2i(n)+mu[e2(n)x1(n-i)-e1(n)x2(n-i)]. 

Error signals e1 and e2 are derived from the 
difference between the filter outputs and the desired 
outputs (d1 and d2). When the filter is used as a 
data equalizer, d1 and d2 are obtained from the 
quantized value of y1 and y2. In the case of a 16- 
QAM system, d1 and d2 can be represented by 2 
bits. Therefore, a 2-bit quantizer is employed. The 
parameter \i is the convergence factor which 
determines the speed of convergence and the 
misadjustment of adaptation process. 

Figures 7 and 8 show block diagrams of a complex 
adaptive DFE which consists of forward and 
feedback equalizers. The forward equalizer is used 
to equalize the pre-cursor ISI, while the feedback 
equalizer cancels the post-cursor ISI. The outputs, 
which correspond to the real and imaginary data 
paths of the DFE, are z1 = f 1 + y1 and 
z2 = f2 + y2. The forward equalizer outputs, y1 
and y2, and the tap update equations are the same 
as those for the LE. The feedback equalizer 
outputs, f 1 and f2, and coefficient adjustments are 
given in the following equations: 

M M 

12) f1(n) = 2b1i(n)d1(n-i) _ 2b2j(n)d2(n-i); 

i=1 i=1 

M = number of taps 

M M 

13) f2(n) = Xb2j(n)d1 (n-i) + £b1j(n)d2(n-i) 

i=i i=i 

14) b1 j (n+1 ) = b1 i(n) + ji[e1 (n)d1 (n-i) + e2(n)d2(n-i)] 

15) b2j(n+1) = b2j(n) + fi[e2(n)d1(n-i) - e1 (n)d2(n-i)] 

All coefficients in the forward and feedback 
equalizer are adjusted simultaneously using the 
LMS algorithm to minimize the mean square error, 
E[e1 2 + e2 2 ]. 

One problem unique to the DFE occurs when either 
d1 or d2 is detected incorrectly. In this case, the 
erroneous data stays in the feedback equalizer for 
several symbol periods, depending on the length of 
the tap. As a result, the incorrect data may cause 
successive errors. This effect is known as error 
propagation. 



DSP16/DSP16A Implementation of 
Complex Adaptive Equalizers 

The implementation of either the LE or DFE using 
the DSP1 6/DSP1 6A processor involves 
programming the convolution and tap adaptation 
algorithms described by equations (7)— (15) and the 
quantizer for obtaining signals d1 and d2. Before 
the program enters into the main loop, the 
initialization procedure transfers the start-up 
coefficient settings and the constants used by the 
quantizer from ROM to RAM. Although the 
constants can be left in ROM, they can be fetched in 
one cycle when they are stored in RAM, whether the 
instructions used to fetch the constants are 
executed from within the cache or not. The data is 
input and output via the serial I/O (SIO) port. Real 
and imaginary data inputs are transferred from SIO 
register sdx to data memory and then moved to the 
tapped delay line before each new iteration begins. 
Since the complex adaptive equalizer requires two 
consecutive input/output operations for synchronous 
data transfer, the sdx read/write commands are 
carefully placed within the program to insure that all 
data is shifted in/out before the next I/O. 
Asynchronous I/O can also be used by adding a 
simple polling routine before the sdx read/write 
command. (Refer to the WE® DSP16 and DSP16A 
Digital Signal Processors Information Manual for an 
example of a polling routine.) 

In the LE, the convolution calculation of both data 
paths, described by equations (8) and (9), is 
executed from the cache with data and coefficients 
in RAM. The data memory for the tapped delay line 
is arranged as follows: x1(0), x2(0), x1(-1), x2(-1), 

x1(-7), x2(-7). The coefficients are arranged 

in memory the same way. The data memory 
organization is shown in Figure 9. The modulo 
addressing capability of the DSP16/DSP16A is most 
suitable for convolution computation and, therefore, 
is used in performing this function. The error used 
in the coefficient update calculation is the difference 
between the input and output of the quantizer. The 
calculation of coefficients follows equations (10) and 
(11). The negative convergence factor, -ji, is stored 
in ROM, and its value is determined by the system 
parameters (i.e., the number of equalizer taps, the 
characteristics of the channel, etc.). 

The difference between the LE and DFE is the 
addition of the feedback equalizer in the DFE. 
Therefore, the programming of the DFE involves 
switching tasks between the forward and feedback 
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Figure 7= Complex Decision Feedback Equalizer Block Diagram 
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Figure 8. Complex Transversal Filters in the DFE 
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datl: 


rb-> 
re-> 


x1(0) 


dat2: 


x2(0) 




X1(-1) 




x2(-1) 




© m 
m m 
m m 




x1(-6) 




x2(-6) 


edatl: 


x1(-7) 


edat2: 


x2(-7) 






erM: 


e1 


err2: 


e2 


trh: 


-0.2 




0.0 




0.2 


qtz: 


-0.3 




-0.1 




0.1 




0.3 



<r- M coef 1 : 


d 


coef2: 


C2 Q 




c1 1 




c2 1 




m m 
© m 

m m 




d 6 




C2 6 




C1 ? 




C2 y 






but: 













dt1: 



Figure 9. Data Memory Organization of Linear 
Equalizer 

equalizers. Pointers rb, re and M are used to 
configure the forward tapped delay line and must be 
saved for the next data input. The new pointer 
settings must be transferred to these registers 
before the feedback equalizer can be computed. 
Using the compound addressing feature provided by 
DSP16/DSP16A, such memory and register content 
swaps can easily be accomplished. The data 
memory organization of the DFE is shown in Figure 
10. 

The 2-bit quantizer is implemented as a subroutine 
which can be called to obtain both signals d1 and 
d2. Also, it can easily be replaced by a different 
subroutine (i.e., 3-bit quantizer in a 64-QAM case) 
without altering the main program. The complete 
program listings for the LE and DFE are given in 
Programs 3 and 4, respectively. 



edtl: 
dt2: 



edt2: 



x1(0) 


cf1: 

ecfl: 
cf2: 

ecf2: 


d 


trh: 
qtz: 

pnt: 
but: 

err: 


-0.2 


x2(0) 


c2 


0.0 


X1(-1) 


c1 1 


0.2 


x2(-1) 


c2 1 




-0.3 


x1(-2) 


d 2 


-0.1 


x2(-2) 


C2 2 


0.1 


d1(-1) 


b1 i 


0.31 


d2(-1) 


b2 i 




rb 


d1(-2) 


m 2 


rb 


d2(-2) 


b2 2 


re 


d1(-3) 


b1 3 




x2(1) 


d2(-3) 


b2 3 


X1(1) 


d1(-4) 


bi 4 


z2(0) 


d2(-4) 


b2 4 


d1(0) 


d1(-5) 


b1 5 


d2(0) 


d2(-5) 


b2 5 


e1 






e2 



Figure 10. Data Memory Organization of 
Decision Feedback Equalizer 

Testing ©f the Equalizer 

A computer-generated test signal was applied to the 
input of the equalizer. A CCITT V.22 bis-compatible 
signal, which is a 16-point QAM signal, was used in 
this example. Its baseband pulse was shaped to a 
75% raised cosine frequency spectrum. Figure 11 
shows the eye-pattern plot of the baseband signal. 
The modulating carrier is 2400 Hz, which allows the 
transmission of 600 baud data through the higher 
frequency band of the voiceband telephone channel. 
The simulated channel represents the worst-case 
long-haul connection, according to the delay and 
amplitude distortion data published by a 1969 
connection survey [3]. 

The LE and DFE programs were run under the 
envl 6 environment (an integrated DSP16/DSP16A 
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Figure 11. Eye Pattern ©f Baseband Data Signal 

assembler and simulator) to obtain the following 
results. Figures 12 and 13 show the learning curves 
of the adaptation process in which the ratio of error 
power, e1 2 + e2 2 , and the average transmitted 
signal power are plotted against the number of 
iterations. The constellation plots of the input and 
output of the equalizer (Figures 14—18) 
demonstrate the effectiveness of the equalizer in 
improving the ISI caused by channel distortions. 

For the LE, there are a maximum of 239 cycles 
required to process each data point and 85 cycles to 
initialize the program. The entire program occupies 
132 ROM words and 43 RAM words. The decision 
feedback equalizer takes slightly more execution 
cycles (265) and memory spaces (178 ROM words 
and 48 RAM words). 

Conclusion 

Implementations of an 8-tap complex linear 
equalizer and a decision feedback equalizer of 3 
forward taps and 5 feedback taps using the 
DSP16/DSP16A DSP are given in this application 
note. Using a 55 ns processor, the throughputs of 
the LE and DFE are under 14 |is and 15 jjs, 
respectively. The misadjustments of both adaptive 
equalizers at steady state are very low even though 
the full dynamic range ( -2 < X < 2-2" 14 of Q14 
format; signal range is -0.4 < X < 0.4) is not utilized. 



This low level of misadjustment is achieved by 
taking advantage of the two 36-bit accumulators and 
by using rounding instead of truncation (default 
operation) prior to the transfer of data from an 
accumulator to data memory or other 16-bit 
registers. The execution of the LMS complex 
adaptive filter alone takes 17 cycles per tap 
including rounding the new coefficient and writing it 
to the data memory. 
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Figure 12. Learning Curve (LE) 
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Figure 14. Constellation Plot of Input Data to Adaptive 
Equalizer (1200 samples) 
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Figure 15. Constellation Plot of LE Output (data 1—600) 
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Figure 16. Constellation Plot of LE Output (data 601—1200) 
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Figure 17. Constellation Plot of DFE Output (data 1—600) 



19 



Digital Eeh® GaneeSeir and Complex Adaptive Equalizers 



f 








# 


♦ 


♦ 








c 





♦ 





4 


♦ 


# 


# 



Figure 18. Constellation Plot of DFE Output (data 601—1200) 
Program Listing 3, Complex Linear Equalizer 



/* 
/* 

/* by S. 



. ram 

datl : 

dat2: 
edatl : 
edat2 : 

errl : 

err2 : 

trh: 

qtz : 

coef 1 : 

coef 2 : 

ecef 1 : 

ecef 2 : 

buf : 

. endram 



*/ 

Complex Linear Equalizer rev. 1.0 */ 

Tu, AT&T Bell Laboratories, Allentown, PA 12/20/88 */ 



int 

13*int 

int 

int 

int 

int 

3*int 

4* int 

int 

13*int 

int 

int 

3*int 



start: auc=0x02 
pt=CI 
r2=coefl 
do 16 { 

*r2++=x 
} 

r0=trh 
do 7 { 

*r0++=x 
} 



/* allocate 16 ram spaces for*/ 
/* 8-tap complex data */ 



/* errl - real error signal */ 

/* err2 -imag. error signal */ 

/* decision threshold levels */ 

/* quantized signal levels */ 

/* allocate 16 ram spaces for */ 

/* 8-tap complex coefficients */ 



/* temporary data storage 



y=a0 x=*pt++ 



y=a0 



x=*pt++ 



*/ 



/* set alignment for Q14 format 

/* pt -> coef. in rom 

/* r2 -> coef. in ram 

/* transfer coefficients from 

/* rom to ram (y=a0 is a dummy 

/* fetch) 

/* rO -> threshold in ram 

/* transfer threshold levels 

/* and quantization levels 

/* from rom to ram 



*/ 
*/ 
*/ 
*/ 
*/ 
*/ 

*/ 
*/ 
*/ 
*/ 
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rl=datl 
rb=datl 
re=edat2 

J-2 
i=0 



/* rl -> data pointer 
/* set virtual shifting 
/* registers rb, re 



*/ 
*/ 

*/ 



loop: rO=buf 

*rO++=sdx 
r2=coef 1 
r3=coef2 



cnvl : 



aO=p 

aO=aO-p 

al=aO 



p=x*y 



do 7 { 



x=*r2 
y=*rl++ 

x=*r3++j 

y=*rl++ 

x=*r2++j 

y=*rl++ 

x=*r2 

x=*r3++j 

y=*rl++ 
x=*r2++j 



aO=aO+p p=x*y 
al=al+p p=x*y 
aO=aO-p p=x*y 
al=al+p 

p=x*y 

} 

aO=aO+p p=x*y 

al=al+p p=x*y 

aO=aO-p p=x*y 

al=al+p 

aO=rnd(aO) 
al=rnd (al) 

*rO++=sdx 

*rO=al 

sdx=aO 



/* error signal calculation of 16-QAM case 
r3=errl 
call dec 
y=*r2 
aO=aO-y 

y=aO x=*pt++ 
p=x*y 
aO=p 

aO=rnd(aO) 
*r3++=a0 

aO=al 
call dec 
y=*r2 
aO=aO-y 



y=aO x=*pt++i 



p=x*y 



aO=p 

aO=rnd(aO) 

*r3=a0 



/* main loop 

/* input real data from SIO 



/* convolution 

/* x = cl 

/* clear aO; y = xl 

/* clear al 

/* p = xl*cl; x = c2 

/* p = xl*c2; y = x2 

/* p = x2*c2; x = cl 

/* p = x2*cl; y = xl (-1) 

/* x = cl_l 

/* p = xl(-l)*cl_l; x = c2_l 



/* roundoff equalizer output 

/* aO,al to 16-bit word 

/* input imag. data from SIO 

/* store imag. output data 

/* output real data to SIO 



*/ 
/* 
/* 
/* 
/* 
/* 



r3 -> error signal (real) 
call subroutine dec 
fetch quantized level dl 
calculate -el = yl - dl 
fetch convergence factor u 



/* calculate el 



-u* (-el) 



*/ 

*/ 



*/ 
*/ 
*/ 
*/ 
*/ 

*/ 
*/ 
*/ 
*/ 
*/ 



*/ 
*/ 

*/ 
*/ 
*/ 

*/ 
*/ 
*/ 
*/ 
*/ 
*/ 



/* write el to ram (errl) */ 
/* repeat for e2 calculation */ 



/* write e2 to ram (err2) 
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/* coefficient adaptation using the LMS algorithm */ 

r2=errl /* r2 -> errl 

r3=err2 /* r3 -> err2 

rO=coefl /* rO -> coefl 

lms: x=*r2 /* x = el 

y=*rl++ /* y = xl 



do 7 { 



p=x*y 


x=*r3 


aO=p p=x*y 


y=*rl++ 


al=p p=x*y 


x=*r2 


aO=aO+p p=x*y 


y=*rO++ 


aO=rnd(aO) 




aO=aO+y 


y=*rO — 


al=al-p 


*rO++=aO 


al=rnd (al) 




al=al+y 


y=*rl++ 


*rO++=al 




} 




p=x*y 


x=*r3 


aO=p p=x*y 


y=*rl — 


al=p p=x*y 


x=*r2 


aO=aO+p p=x*y 


y=*rO++ 


aO=rnd(aO) 




aO=aO+y 


y=*rO — 


al=al-p 


*rO++=aO 


al=rnd (al ) 




al=al+y 




*rO++=al 




y=*rO++ 




*rl++=y 




y=*rO++ 




*rl — =y 




sdx=*rO 




goto loop 




/* subroutine: dec - 2- 


-bit quant 


dec : rO=trh 




r2=qtz 




y=*rO++ 




aO-y 




if le return 




*r2++ 




y=*rO++ 




aO-y 




if le return 




*r2 + + 




y=*rO 




aO-y 




if le return 




*r2++ 




return 





/* p = el*xl; x = e2 

/* p = e2*xl; y = x2 

/* p = e2*x2; x = el 

/* p = el*x2; y = cl 

/* aO = cl (n+1) ; y = c2 

/* update cl 

/* al = c2 (n+1) ; y = xl 

/* update c2 



/* transfer new data to delay 
/* line 



/* output imag. data to SIO 

/* return to main loop 

*/ 

/* rO -> trh 

/* r2 -> qtz 

/* fetch threshold level -0. 

/* to y and compare with aO 

/* if greater, increment r2 



/* compare with threshold 0, 
/* if greater, increment r2 



/* compare with threshold 0.2 
/* if greater, increment r2 



*/ 
*/ 
*/ 
*/ 
*/ 



*/ 
*/ 
*/ 
*/ 

*/ 
*/ 

*/ 
*/ 



*/ 
*/ 



*/ 
*/ 



*/ 
*/ 
*/ 
*/ 
*/ 



*/ 
*/ 



*/ 
*/ 
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CI: 


int 





CQ: 


int 







int 







int 







int 







int 







int 


1 




int 







int 







int 







int 







int 







int 







int 







int 







int 





TRH: 


int 


-0 




int 







int 





QTZ: 


int 


-0 




int 


-0 




int 







int 





U: 


int 


-0 



/* start of real coef . */ 
/* start of imaginary coef. */ 



,2 /* threshold levels */ 

,0 

,2 

,3 /* quantization levels */ 

.1 

,1 

,3 

,0625 /* -( convergence factor ) */ 



Program Listing 4. Complex Decision Feedback Equalizer 



/* */ 

/* Complex Decision Feedback Equalizer rev. 1.0 */ 
/* by S. Tu, AT&T Bell Laboratories, Allentown, PA 1/10/89 */ 



. ram 

dtl: 
edtl: 

dt2: 
edt2 : 

cfl: 
ecfl : 

cf2: 
ecf2 : 

trh: 

qtz : 

pnt : 

buf : 

err : 

. endram 



5* int 

int 
9* int 

int 
5* int 

int 
9*int 

int 
3*int 
4*int 
3*int 
5*int 
2* int 



start 



auc=0x02 

J-2 
i=0 
pt=CI 
r0=cfl 
do 23 { 

*r0++=x 

} 

rl=dt2 

rb=dt2 

re=edt2 

*r0++=rl 

*r0++=rb 

*r0++j=re 



/* 
/* 
/* 
/* 



Allocate 6 RAM spaces for 
3-tap forward equalizer. 
Allocate 10 RAM spaces for 
5-tap feedback equalizer. 



/* Allocate 6 RAM spaces for 

/* 3-tap forward equalizer coef. 

/* Allocate 10 RAM spaces for 

/* 5-tap feedback equalizer coef. 

/* decision threshold level 

/* quantized signal level 

/* pointer rl,rb and re 

/* temporary data storage buffer 

/* error signal 



*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 



y=a0 x=*pt++ 



/* set alignment for Q14 format 



/* pt -> coef. in ROM */ 

/* r2 -> coef. in RAM */ 

/* Transfer coefficients and */ 

/* constants from ROM to RAM */ 

/* (y=a0 is a dummy fetch) . */ 

/* rl -> data pointer */ 

/* Configure tapped delay line */ 

/* using rb, re for feedback */ 

/* equalizer, save pointers */ 

/* in RAM. */ 
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rl = 


=dtl 














rb= 


=dtl 














re= 


=edtl 












loop: 


*rO=sdx 














r2 = 


=cfl 












cnvll : 


a0= 


=P 








x= 


=*r2++ 




a0= 


=aO-p 








y= 


=*rl++ 




al = 


=a0 


















P= 


=x 


*y 


x= 


=*r2 — 


do 


2 { 
















a0= 


=aO+p 


P= 


=x 


*y 


y= 


=*rl++ 




al = 


=al+p 


P= 


=x 


*y 


x= 


=*r2++j 




a0= 


=aO-p 


P= 


=x 


*y 


y= 


=*rl++ 




al = 


=al+p 








x= 


=*r2++ 








P= 


=x 


*y 


x= 


=*r2 — 



} 



aO=aO+p p=x*y y=*rl++ 

al=al+p p=x*y x=*r2++j 
aO=aO-p p=x*y 

al=al+p x=*r2++ 

rO=pnt 
*rOzp : rl 
*rOzp : rb 
*rOzp : re 



cnvl2 : 



y=*rl++ 



p=x*y x=*r2- 



do 4 { 



y=*rl++ 
x=*r2++j 
y=*rl++ 
x=*r2++ 
x =* r 2 — 

y=*rl++ 
x=*r2++j 



aO=aO+p p=x*y 
al=al+p p=x*y 
aO=aO-p p=x*y 
al=al+p 

p=x*y 
} 

aO=aO+p p=x*y 
al=al+p p=x*y 
aO=aO-p p=x*y 
al=al-l-p 

aO=rnd(aO) 
al=rnd(al) 

*rO++j=sdx 

sdx=aO 

*rO++=al 



/* error signal calculation for 16-QAM case 
call dec 



y=*r2 
aO=aO-y 

p=x*y 
aO=p 

aO=rnd(aO) 
*r0 — =a0 

aO=al 



*rO++j=y 
y=aO x= 



*pt++i 



/* Configure tapped delay line 
/* for forward equalizer. 



/* main loop 

/* input real data from SIO 

/* convolution - forward eqlzr 

/* x = cl 

/* clear aO; y=xl 

/* clear al 

/* p = xl*cl; x = c2 

/* p = xl*c2; y = x2 

/* p = x2*c2; x = cl 

/* p = x2*cl; y = xl (-1) 

/* x = cl_l 

/* p = xl(-l)*cl_l; x = c2_l 



/* x = bl 



/* Exchange rb, re, rl 

/* contents with data in RAM 

/* for the feedback equalizer 

/* convolution. 

/* y = dl 

/* p = dl*bl; x = b2 

/* p = dl*b2; y = d2 

/* p = d2*b2; x = bl 

/* p = d2*bl; y = dl (-1) 

/* x = bl_l 

/* p = dl(-l)*bl_l; x = b2_l 



/* Roundoff equalizer output 

/* aO, al to 16-bit word. 

/* input imag. data from SIO 

/* output real data to SIO 

/* Save imag. output data 

/* for later writing to sdx. 



*/ 
/* 
/* 
/* 
/* 
/* 



call subroutine dec 
fetch quantized level dl 
-el = yl-dl; save dl 
fetch convergence factor u 



el 



(~el) 



/* round el to 16 bit 
/* save el in ram (errl) 

/* repeat for e2 calculation 



*/ 
*/ 



*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 

*/ 
*/ 
*/ 
*/ 
*/ 



*/ 
*/ 
*/ 
*/ 
*/ 
*/ 

*/ 
*/ 
*/ 
*/ 
*/ 



*/ 
*/ 

*/ 
*/ 
*/ 
*/ 



*/ 
*/ 
*/ 
*/ 
*/ 

*/ 
*/ 
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call dec 

y=*r2 

aO=aO-y 



*rO++j=y 

y=aO x=*pt++i 



p=x*y 



aO=p 

aO=rnd(aO) 
*r0 aO 

/* feedback equalizer coefficient adaptation using the LMS algorithm 
r2=err 



lms 1 : 



r3=cf2 

x=*r2++ 

y=*rl++ 



do 4 { 



p=x 


*y 


x=*r2 — 


aO=p p=x 


*y 


y=*rl++ 


al=p p=x 


*y 


x=*r2++ 


aO=aO+p p=x 


*y 


y=*r3++ 


aO=rnd(aO) 






aO=aO+y 




y=*r3 — 


al=al-p 




*r3++=a0 


al=rnd (al) 






al=al+y 




y=*rl++ 


*r3++=al 

} 

p=x 






*y 


x=*r2 — 


aO=p p=x 


*y 


y=*rl 


al=p p=x 


*y 


x=*r2 


aO=aO+p p=x 


*y 


y=*r3++ 


aO=rnd(aO) 






aO=aO+y 




y=*r3 — 


al=al-p 




*r3++=a0 


al=rnd (al) 






al=al+y 




*r0 — 


*r3++=al 






y=*rO — 






*rl — =y 






y=*rO 






*rl=y 






rO=pnt 






*rOzp:rl 






*rOzp: rb 






*rOzp: re 







/* 


x = el 




/* 


y = dl 




/* 


p = el*dl; 


x = e2 


/* 


p = e2*dl; 


y = d2 


/* 


p = e2*d2; 


x = el 


/* 


p = el*d2; 


y = bl 



/* dbl=rnd(el*dl+e2*d2) 
/* bl (l)=bl (0)+dbl; y = 
/* update bl in ram 



b2 



/* db2=rnd(e2*dl-el*d2) 

/* b2 (l)=b2 (0)+db2; y = dl 

/* update b2 in ram 



/* Update the feedback eqlzr 
/* delay line elements dl (0) r 
/* d2 (0) . 



/* Exchange the pointer 

/* settings for the forward 

/* equalizer tap update. 



*/ 



*/ 
*/ 

*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 

*/ 
*/ 
*/ 



*/ 
*/ 
*/ 



*/ 
*/ 
*/ 
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/* forward equalizer coefficient adaptation using the LMS algorithm 



lms2 : r3=cf 1 






x=*r2++ 






y=*rl++ 






do 2 { 






p=x 


*y 


x=*r2 — 


aO=p p=x 


*y 


y=*rl++ 


al=p p=x 


*y 


x=*r2++ 


aO=aO+p p=x 


*y 


y=*r3++ 


aO=rnd(aO) 






aO=aO+y 




y=*r3 — 


al=al-p 




*r3++=a0 


al=rnd (al) 






al=al+y 




y=*rl++ 


*r3++=al 
} 






p=x^ 


*y 


x=*r2 — 


aO=p p=x : 


*y 


y=*rl — 


al=p p=x : 


*y 


x=*r2++ 


aO=aO+p p=x : 


k Y 


y=*r3++ 


aO=rnd(aO) 






aO=aO+y 




y=*r3 — 


al=al-p 




*r3++=a0 


al=rnd (al) 






al=al+y 




*r0++ 


*r3++=al 






y=*rO — 






*rl++=y 






y =*r0++ j 






*rl — =y 






sdx=*rO — 






goto loop 






/* subroutine: dec - 


- 2- 


-bit quantizer 


dec: r3=trh 






r2=qtz 






y=*r3++ 






aO-y 






if le return 





/* 


X 


= 


el 








/* 


y 


= 


xl 








/* 


p 


= 


el*xl; 


X 


= 


e2 


/* 


p 


= 


e2*xl; 


y 


= 


x2 


/* 


p 


= 


e2*x2; 


X 


= 


el 


/* 


p 


= 


el*x2; 


y 


= 


cl 



/* del = rnd(el*xl+e2*x2) 

/* cl(l) = cl(0)+dcl; y = c2 

/* update cl in ram 

/* dc2 = rnd(e2*xl-el*x2) 

/* c2(l) = c2(0)+dc2; y = xl 

/* update c2 in ram 



*/ 
*/ 

*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 
*/ 



/* Transfer new data to forward */ 

/* equalizer delay line elements */ 

/* xl(0) , x2 (0) . */ 

/* output imag. data to SIO */ 

/* return to main loop */ 

*/ 

/* r3 -> trh */ 

/* r2 -> qtz */ 

/* fetch threshold level -0.2 */ 

/* to y and compare with aO */ 

/* if greater, increment r2 */ 
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*r2++ 






y=*r3++ 




aO-y 






if le 


return 




*r2++ 






y=*r3 






aO-y 






if le 


return 




*r2++ 






return 


CI: 


int 





CQ: 


int 







int 







int 







int 


1.0 




int 





BCI: 


int 





BCQ: 


int 







int 







int 







int 







int 







int 







int 







int 







int 





TRH: 


int 


-0.2 




int 


0.0 




int 


0.2 


QTZ: 


int 


-0.3 




int 


-0.1 




int 


0.1 




int 


0.3 


U: 


int 


-0.05 



/* compare with threshold 0, */ 
/* if greater, increment r2 */ 



/* compare with threshold 0.2 */ 
/* if greater, increment r2 */ 



/* forward equalizer coef. 



/* feedback equalizer coef. 



/* threshold levels 



/* quantization levels 



/* - (convergence factor) 



*/ 



*/ 
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